Robust Multi-Agent Coordination via Evolutionary Generation of Auxiliary Adversarial Attackers
نویسندگان
چکیده
Cooperative Multi-agent Reinforcement Learning (CMARL) has shown to be promising for many real-world applications. Previous works mainly focus on improving coordination ability via solving MARL-specific challenges (e.g., non-stationarity, credit assignment, scalability), but ignore the policy perturbation issue when testing in a different environment. This hasn't been considered problem formulation or efficient algorithm design. To address this issue, we firstly model as Limited Policy Adversary Dec-POMDP (LPA-Dec-POMDP), where some coordinators from team might accidentally and unpredictably encounter limited number of malicious action attacks, regular still strive intended goal. Then, propose Robust Multi-Agent Coordination Evolutionary Generation Auxiliary Adversarial Attackers (ROMANCE), which enables trained diversified strong auxiliary adversarial attacks during training, thus achieving high robustness under various perturbations. Concretely, avoid ego-system overfitting specific attacker, maintain set attackers, is optimized guarantee attackers attacking quality behavior diversity. The goal minimize effect, novel diversity regularizer based sparse applied diversify behaviors among attackers. then paired with population selected maintained attacker set, alternately against constantly evolving Extensive experiments multiple scenarios SMAC indicate our ROMANCE provides comparable better generalization than other baselines.
منابع مشابه
Multi-Agent Mission Coordination via Negotiation
This paper is intended to give an intuitive overview of the operations of MASA-CIRCA, the Multi-Agent SelfAdaptive Cooperative Intelligent Real-Time Control Architecture. While individual CIRCA agents have been under development for some time, we have only recently begun developing the architecture’s multiagent capabilities. This paper briefly describes the high-level negotiation functions that...
متن کاملCoordination of an Asynchronous Multi-agent System via Averaging
This paper is concerned with the coordination of a group of n > 1 mobile autonomous agents which all move in the plane with the same speed but with different headings. Each agent updates its heading from time to time to a new value equal to the average of its present heading and the headings of its current “neighbors”. Although all agents use the same rule, individual updates are executed async...
متن کاملIntegrated Multi- Agent Coordination
Much work in multi-agent systems focuses on coordinating the activities of agents so that the end result approximates the solutions possible if one were to centralize the activities being carried out by these agents. Research into coordination has taken different forms: negotiation, scheduling, planning, organizational approaches, etc. Multi-agent coordination, defined as managing inter-depende...
متن کاملAdversarial Behavior in Multi-agent Systems
Adversariality of the agents with respect to the multi-agent system can be a serious issue in the design of open multi-agent systems. Until now, many incoherent definitions of such behavior were used, preventing the consolidation of the knowledge about the domain. By basing ourselves on the valid and accepted results from economics, law and conflict theory, we propose a consistent definition of...
متن کاملMulti-Agent Diverse Generative Adversarial Networks
We propose an intuitive generalization to the Generative Adversarial Networks (GANs) and its conditional variants to address the well known mode collapse problem. Firstly, we propose a multi-agent GAN architecture incorporating multiple generators and one discriminator. Secondly, to enforce different generators to capture diverse high probability modes, we modify discriminator’s objective funct...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i10.26388